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Abstract 

We show that it is quasi-NP-hard to color 2-colorable 8-uniform hypergraphs with 
2 (iogiV)Oio °(i) where N is the number of vertices. There has been much focus 

on hardness of hypergraph coloring recently. In Guruswami, Hastad, Harsha, 
Srinivasan and Varma showed that it is quasi-NP-hard to color 2-colorable 8-uniform 
hypergraphs with 2^ (vi gi g«) Their result is obtained by composing stan¬ 

dard Label-Cover with an inner-verifier based on Low-Degree-Long-Code, us¬ 
ing Reed-Muller code testing results by Dinur and Guruswami m- Using a different 
approach in m. Khot and Saket constructed a new variant of Label-Cover, and 
composed it with Quadratic-Code to show quasi-NP-hardness of coloring 2-colorable 
12-uniform hypergraphs with colors, for some c around 1/20. Their construc¬ 

tion of Label-Cover is based on a new notion of superposition complexity for CSP 
instances. The composition with inner-verifier was subsequently improved by Varma, 
giving the same hardness result for 8-uniform hypergraphs |37] . 

Our construction uses both Quadratic-Code and Low-Degree-Long-Code, 
and builds upon the work by Khot and Saket. We present a different approach to 
construct CSP instances with superposition hardness by observing that when the 
number of assignments is odd, satisfying a constraint in superposition is the same as 
odd-covering a constraint. We employ Low-Degree-Long-Code in order to keep 
the construction efficient. In the analysis, we also adapt and generalize one of the key 
theorems by Dinur and Guruswami |12j in the context of analyzing probabilistically 
checkable proof systems. 


1 Introduction 

For an integer k >2, a fc-nniform hypergraph H = (U, F) consists of vertex set V and edge 
set F C A set of vertices 5 C 1/ is an independent set if for all / E F, / ^ S, i.e., no 
edge is completely inside S. A hypergraph is g-colorable if its vertices can be partitioned 
into q disjoint independent sets. 

We use a{F{) to denote the fractional size of the maximum cardinality independent set 
of F[, also known as the fractional independence number, and we use x{H) to denote the 
minimum q such that H is g-colorable. It is easy to verify that we have x{H)oi{H) > 1 for 
any H. 

In the ordinary graph case, corresponding to A: = 2, deciding whether a graph G has a 
2-coloring is the same as deciding whether it is a bipartite graph, and can be easily solved 
in polynomial time. In general, however, determining the chromatic number of a graph 
exactly is NP-hard [16]. In fact, even coloring 3-colorable graphs with 4 colors is NP-hard. 
For general q-colorable graphs, it is NP-hard to color with q 2[|J — 1 colors [MIESI. 
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For sufficiently large q, it was shown that it is NP-hard to color a q-colorable graph with 
colors [22], improving on an earlier lower-bound of q 25 1 by Khot m- Assuming 

a variant of Khot’s 2-to-l Conjecture, Dinur, Mossel and Regev [T3| proved that it is NP- 
hard to g'-color a g-colorable graph for any 3 < q < q'. The dependency between the 
hardness of graph coloring and the parameters of 2-to-l Label-Cover was made explicit 
and improved by Dinur and Shinkar m, who showed that it is NP-hard to (logn)‘^-color 
a 4-colorable graph for some constant c > 0 assuming the 2 -to-l Conjecture. As for 
algorithms, there have been many results as well [38l El EH [9| . For 3-colorable graphs, the 
best algorithm is by Kawarabayashi and Thorup [25| which uses O( 77 , 0 - 19996 ^ colors, based 
on results by Arora and Chlamtac |3] , Chlamtac m and the earlier work of Kawarabayashi 
and Thorup [23]. As we can see, there is still a huge gap between the best approximation 
guarantee and the best hardness result. 

For k > 3, even determining whether a A:-uniform hypergraph has a 2-coloring is NP- 
hard. In terms of approximation algorithms, the best algorithm for 2-colorable 3-uniform 
hypergraphs still requires colors [321 [mini. 

From the hardness side, the first super-constant hardness result was proved in [18] . The 
main result there is that for 2-colorable 4-uniform hypergraphs, finding a coloring with any 
constant number of colors is NP-hard, and hnding a coloring with 0 (log log n/ log log log n) 
colors is quasi-NP-hard. For 2-colorable 3-uniform hypergraphs, a similar hardness result 
was proved in M- Khot |28| proved that coloring 3-colorable 3-uniform hypergraphs 
with any constant number of colors is hard, and for g'-colorable 4-uniform hypergraphs, 
coloring with (logn)^^'^) colors is quasi-NP-hard for q > 7. The analysis in [18] was 
improved by Holmerin, who proved that even finding an independent set of fractional size 
D(log log log n/log log n) is quasi-NP-hard [ 21 ] . The construction was further improved 
recently by Saket |3H|, who proved that it is quasi-NP-hard to find independent set of size 
n/(log in 2-colorable 4-uniform hypergraphs [36] . There has also been work on the 

hardness of finding independent sets in almost 2 -colorable hypergraphs — hypergraphs 
that becomes 2-colorable after removing a small fraction of vertices. Much stronger result 
is known, albeit at the cost of imperfect completeness. We refer to m for more details. 

Recently, in ini, Guruswami, Harsha, Hastad, Srinivasan and Varma proved the first 
super-polylogarithmic hardness result for hypergraph coloring, showing hardness for col¬ 
oring 2-colorable 8 -uniform hypergraphs with colors. Their reduction uses the 

Low-Degree-Long-Code proposed in |7|, based on techniques for testing Reed-Muller 
codes developed in [T 2 |. 

Using a very different approach, Khot and Saket gave another exponential improvement 
in [29], showing a quasi-NP-hardness for coloring 2-colorable 12 -uniform hypergraphs with 
exp((log colors, where the constant in D(l) is around 1 / 20 , although it might 

be improved with a more careful analysis of their reduction. Part of their analysis was 
subsequently simplified by Varma in m using ideas from m- 

In this work, we give another improvement for hardness of hypergraph coloring. Our 
main result is as follows. 

Theorem 1.1. It is quasi-NP-hard to color a 2-colorable 8-uniform hypergraph of size N 
with colors. 

1.1 Proof Overview 

We start by describing the PCP reduction of proving hypergraph coloring hardness used 
in many previous works. Most of these results show hardness of finding an independent set 
of large fractional size. We can view the output of these reductions as NoxALLEQUALfc 
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CSP instances. The variables correspond to the vertices of a hypergraph, and the No- 
tAllEqual^ constraints correspond to the hyperedges. Note that for hypergraph coloring 
results, all variables appear positively in such instances and no negations are allowed. An 
assignment that satisfies all the NotAllEqual^ constraints thus gives a perfect 2-coloring 
for the hypergraph. In the other direction, a set of vertices in the hypergraph naturally 
corresponds to a {0,1} assignment to the variables in the NotAllEqual^ instance, and 
the vertices form an independent set if for all constraints in the NotAllEqual^ instances, 
there is at least 1 variable that is assigned 0. 

The starting point of the reduction is usually some Label-Cover hardness. We then 
encode the supposed labeling for the Label-Cover instance with some coding scheme, 
and design a PCP to test the consistency of the labeling. 

One classical choice of encoding is the Long-Code, which encodes m bits of informa¬ 
tion with 2^™ bits. This huge blowup makes it impossible to prove hardness results better 
than polylogn via the Label-Cover plus Long-Code approach. 

A much more efficient encoding is the Hadamard code, which only uses 2™ bits to 
encode m bits of information. However, the disadvantage of the Hadamard code is that 
one can only enforce linear constraints on the codewords, which means that we can only 
start from hard problems involving only linear constraints, and as a result, we lose perfect 
completeness and can only prove results about almost coloring. 

The Low-Degree-Long-Code proposed in [7] lies somewhere between Long-Code 
and Hadamard code. We can view Hadamard code as encoding m bits by writing down the 
evaluation of all m-variable functions of degree at most 1 on these m bits, and Long-Code 
as writing down the evaluation of all possible m-variable functions — that is, degree up 
to m — on these m bits. Low-Degree-Long-Code has a parameter d, the degree, and 
the encoding writes down the evaluation of all polynomials of degree at most d. Dinur 
and Guruswami [12j obtained hardness result for a variant of hypergraph coloring based 
on Low-Degree-Long-Code, and the techniques were soon adapted in m to get a 

1 j ij. r n2^(Vlog log n) 

hardness result of 2 

The aforementioned result by Khot and Saket [29] uses Quadratig-Code, which is 
the same as Low-Degree-Long-Code with d = 2. Their construction, however, is 
completely different from that in HZ). 

One can view the Quadratig-Code used in |29] as the Hadamard encoding of matrix 
M that is symmetric and has rank 1, that is, there exists some u G F™ such that M = 
u O u. Khot and Saket described a 6-query test such that if some encoding function 
/ : F 2 passes the test with non-trivial probability, then we can decode it into a 

low rank matrix. 

In order to use this encoding, it seems natural that one would like to construct some 
variant of Label-Cover where the labels are now matrices, with some linear constraints 
on the entries of the matrices (since as discussed above we are using Hadamard code 
to encode the matrices). In the completeness case, we would like to have some matrix 
labelings of rank 1 that satishes all linear constraints on the vertices as well as projection 
constraints on the edges, and in the soundness case, not even labelings with low rank 
matrices can satisfy more than a small fraction of them. 

Such Label-Cover hardness result is not readily available. Khot and Saket proposed 
the notion of superposition complexity for quadratic equations. Briefly speaking, let q{x) = 
c + + 'l2i<i<j<m CijXiXj = 0 be a quadratic equation on m F 2 -variables. We say 
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that t assignments G F™ satisfy the equation q{x) = 0 in superposition if 

i=l \l=l / l<i<j<ra \/=l / 

If we have a system of quadratic equations, then we say that t assignments satisfy the 
system of quadratic equations in superposition if each quadratic equation is satished in 
superposition. Having a small number of assignments satisfying quadratic constraints in 
superposition is exactly the same as having a symmetric low-rank matrix satisfying the 
linearized version of the constraints, as we discuss in more detail in Section [2j 

Through a remarkable chain of reductions, Khot and Saket established the inapprox- 
imability of quadratic equations with superposition complexity, as well as the actual con¬ 
struction of the Label-Cover with matrix labels. They started with superposition hard¬ 
ness for E3-Sat with gap of 1/n, and use low-degree testing and sum-check protocol like 
in the original proof of the PCP theorem mis] to achieve a superposition hardness result 
for systems of quadratic equations with good soundness and moderate blowup in size. 
This is then followed by a Point versus Ruled Surface test which produces the actual 
Label-Cover instance. 

The main contribution of this work is a much simpler and more efficient construction 
of systems of quadratic equations with superposition and approximation hardness. This 
is then coupled with a slight variant of the Point versus Ruled Surface test used by Khot 
and Saket to obtain hardness for Label-Cover with matrix labels. 

Let t be some odd natural number. A set of t assignments odd-covers an equation 
(or more generally, a constraint) if the number of assignments that satisfy the equation is 
odd. We show in Section [2] that the notion of odd-covering is equivalent to satisfaction 
in superposition when the number of assignments is odd. This viewpoint enables us to 
construct the kind of Label-Cover instance used in [29] very easily. In fact, the reduction 
in Section [3] looks very much like a classical CSP inapproximability proof. 

Although simpler, the above observation alone is not sufficient to give us a hardness 
result better than [29] . The issue here is that for the reduction in Section [3] to work for 
our choice of parameters, the soundness of the Label-Cover that we start with needs 
to be sub-constant, and a typical Long-Code reduction will again blow up the size of 
the instance by too much. Hence, for this step, we employ Low-Degree-Long-Code. 
Our technical contribution here is Theorem 12.271 a generalization of the Reed-Muller code 
testing result of [T2|. 

2 Preliminaries 

Before we discuss the relation between superposition, odd-covering and low rank matrices, 
we define an operation on vectors and matrices that we will use frequently. 

Definition 2.1. Define Di : ^ F™ as the operator that removes the first coordinate 

of a vector. Define Di similarly for matrices as the operator that removes the first row 
and column of a given matrix. 

2.1 Superposition and Odd-Covering 

Khot and Saket [29] dehned the notion of satisfying in superposition as follows. 
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Definition 2.2 (Superposition). Let ■ ■ ■ , € F™ be t assignments and q{x) 

a quadratic equation in m ¥ 2 -variables with 

m 

q{x) = c + ^ CiXi + ^ CijXiXj . 


i=l 

We say that the t assignments satisfy the equation q{x) = 0 in superposition if 


+ E - 

i=l \/=l / l<i<j<m 


t 


E 

\i=i 


( 0„(0 _ 




= 0 . 


0 be 


Definition 2.3. Given a system of quadratic equations {qi{x) = 0}^^ on variables xi, • • • , Xm, 
its superposition complexity is the minimum number t, if it exists, such that there are t 
assignments • • • , that satisfy each equation qi{x) = 0 m superposition. 

We define the odd superposition complexity (or even superposition complexity^ to be 
the minimum odd integer t (or even integer t, respectively) such that there are t assignments 
that satisfy all equations in superposition. 


Note that by simply adding all 0 assignments, we can argue that the above three notions 
of superposition complexity differ by at most 1 . 

We now explain the relation between superposition complexity of quadratic equations 
and low rank matrices. Assume for simplicity of exposition that the quadratic equation 
q{x) = 0 as defined above is homogeneous, that is, the constant term c and the linear 
coefficients c* are all 0 . 

We can express a homogeneous quadratic equation q{x) = 0 with a matrix by defining 
C € F^^™', where Cij = Cij for 1 < i < j < m, and Cij = 0 otherwise. Let x = 
{xi X2 ■ ■ ■ Xm)- Then q{x) = 0 is the same as {C, x®x) = x'^Cx = 0, where (•, •) denotes 
the entry-wise dot product of two matrices. Note that x®x is a symmetric rank-1 matrix. 

Suppose now that we have a symmetric matrix A such that (C, A) = 0. For a fixed 
C, this is a linear constraint on the entries of A. If in addition A has rank 1, then there 
exists Xa, such that A = Xa® Xa, and by the above, we have that Xa satisfies q{xa) = 0 . 
Therefore, if A is a symmetric rank 1 matrix and (C, A) = 0, then A encodes an assignment 
that satisfies the quadratic equation q{x) = 0 . 

The following decomposition lemma from [29j illustrates the situation when A has low 
rank. 


Lemma 2.4. Let A G q symmetric matrix of rank k over F 2 . Then there exists 

I < 3A:/2 and vectors vi, - ■ ■ ,vi in the column space of A, such that A = Yli=i Vi®Vi. 

Let A be a low rank matrix such that {C, A) = 0 and ui, • • • ,vihe I < 3k/2 assignments 
given by Lemma 12.41 Then 

i 

0 = {C,A) = J2iCwt®vt) 

t=i 

i 

= CijVtiVtj 

t=l l<i<j<m 

I 

= Cij VtiVtj . 

l<i<j<m t=l 

Therefore we have that ui, • ■ ■ ,vi satisfy q{x) = 0 in superposition. 

The notion we will now consider is the following, which we call odd-covering. 
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Definition 2.5 (Odd-covering). Let G F™ be t assignments and q{x) = 0 be 

a quadratie equation in m ¥ 2 -variables as defined above. We say that the t assignments 
odd-cover the equation q{x) = 0 if the number of assignments that satisfies = 0 

is odd. 


The key observation is that odd-covering and satisfying in superposition are equivalent 
when the number of assignments involved is odd. 

Lemma 2.6. Let t be an odd integer and • • • , G F™ be t assignments, and q{x) = 0 
be a quadratic equation in m ¥ 2 -variables as defined above. Then the t assignments satisfy 
q{x) = 0 in superposition if and only if the t assignments odd-cover q{x) = 0 . 

Proof. Using the fact that t is odd, we have the following 




1=1 


1=1 y i=l 

t m t 

*-+EE‘=-T+E e 

i=l 1=1 l<i<j<7n 


(0 (0 
Cijal 'a) 


m 


(0 (0 
Cija\ ’a) 


t 


= <^+EOE“r’ + E E 

i=l \l=l J \l=l 
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Now observe that the t assignments odd-cover q{x) = 0 if and only if the number of 
assignments that does not satisfy q{x) = 0 is even, which is equivalent to saying that 
the left hand side of the above equation is 0 , and that by definition means that the t 
assignments satisfy q{x) = 0 in superposition. □ 


In the description above, we assumed that the quadratic equation q{x) = 0 is ho¬ 
mogeneous, which allows us to encode it with a matrix C G F™^”^ and express the 
whole equation as {C, A) = 0, where A = x ® x. For quadratic equations that are 
not homogeneous, we encode them with a {m -|- 1) x (m -|- 1) matrix. In particular, for 
q{x) = c ^ CiXi + CijXiXj = 0, we have matrix C, where Cn = c, Cu = Cj-i for 
i = 2, - ■ ■ , m -|- 1, and Cij = Ci-ij-i for 2<i<j<m-\-l. As for the variable vector, we 
add to X an entry that is always 1. 

Definition 2.7. Given a matrix A G g^y ^ pseudo-quadratic 

if the following holds: 

• A is symmetric. 

• Ai^i = 1 . 

• For all i = 2, ■ ■ ■ ,m 1, 

Note that for vector v G F™^^ such that ui = 1, u 0 u is a pseudo-quadratic rank-1 
matrix. 

We prove a stronger form of Lemma f2.4l for pseudo-quadratic matrices where we decode 
a low rank pseudo-quadratic matrix into an odd number of assignments. 

Lemma 2.8. Let A G ^g ^ pseudo-quadratic matrix of rank k over F 2 . Then 

there exists an odd integer ko < 3k/2-\-l, and vectors vi, - ■ ■ , Vkg G F™^^, such that for all 
i G [fco], Vi^i = 1, and A = Yli=i Vi®Vi. Moreover, for all i G [/cq]? Di{vi) is in the column 
space of Di{A). 
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Proof. Let A' = Di{A). Note that A' is symmetric and has rank at most k. Therefore by 
Lemma [M there exists I < 3k/2 vectors ni, • • • ,ui € F^, such that A' = Ui® Ui. 
Now consider vectors vi,--- ,vi € where for each i, Vi^i = 1 and Vij = Uij-i for 

j = 2, • • • , m + 1. Let A" = ® B = A — A”. For j,j' € {2, • • • , m + 1}, we 

have 

i i 

i=l i=l 

Moreover, we have 

i i 

^i,j ~ ~ ~ ^j,j ~ ~ j ■ 

i=l i=l 

We conclude that for all {i,j) 7 ^ (1,1), Aij = A'fy Note that Af ^^ = {I mod 2). Therefore 
if A'( = 1 = Ai^i, then we have I is odd and A = Vi Vi as promised. Otherwise 

I is even. Let e = (1 0 • • • 0) € . Then A = 'Yl\=i Vi® Vi + e ® e gives the desired 

decomposition. □ 

The following lemma summarizes the discussion at the beginning of this section and 
relates odd superposition complexity with low-rank pseudo-quadratic matrices. 

Lemma 2.9. Let qi{x) = 0, • • • ,qsix) = 0 be a set of s quadratie equations on vari¬ 
able xi, • • • , Xm, and let Qi, - ■ ■ ,Qs G corresponding matrix forms. 

Suppose there is a pseudo-quadratic matrix A G rank(^) < k 

and for all i G [s], {Qi,A) = 0, then there exists integer I < 3k/2 + 1 and I vectors 
G F™^^, such that A = X]i=i (8) where for all j G [1], we have = 1 
and that Di{a^^^) is in the column space of Di{A). This implies that the assignments 
Di{a^^'l), ■ ■ ■ ,Di{a^^^) satisfy all equations qi{x) = 0, - ■ ■ ,qs{x) = 0 in superposition. 

Proof. Apply Lemma iTHl to A, and let ui, • • • ,vi be the vectors we get, with va = 1 for 
i £ [1], and A = Vi®Vi. We now verify that Di{vi), - ■ ■ , Di{vi) satisfy all equations 

in superposition. 

Consider equation z for f G [s]. We have 

i 

0 = {Qi,A) = y^^{Qi,Vi ® Vj) 

i=l 

I 

i=l 

By definition, we have that ui, • ■ ■ ,vi satisfy qi in superposition. □ 

2.2 Label-Cover 

The starting point of our reduction is the Label-Cover hardness obtained from E3-Sat 
instances. We use Label-Cover instances obtained by applying the PCP Theorem mis] 
and the Parallel Repetition Theorem |34j . The exact formulation below is from HZj. 

Definition 2.10. Let (j) be a E3-Sat instance with X as the set of variables and C the set 
of clauses. The r-repeated Label-Cover instance C{r,(f>) is specified by: 

• A bipartite graph G = (C, V, E), where V := C’" and U := . 
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• Label set for U, denote by L := {0, and label set for V, denote by R := {0,1}^^. 

• There is an edge {u,v} € E if for each i € \r], Ui is a variable appearing in clause 

Vi. 

• For edge {u,v}, the constraint tTuv ■ {0,1}^^ —>■ {0,1}^ is the projection of the as¬ 
signment of the 3r clause variables in v to the assignment of the r variables in u. 

• For each v & V, there is a set of r functions {ff : {0, —>■ {0, ^Y}i^[r], such that 

ffia) = 0 if and only if the assignment a satisfies the clause Vi. Note that each ff 
depends only on 3 entries of a. 

A labeling a : U ^ L,V ^ R satisfies an edge {m, f} iff T^uvicriV)) = (y{U), and cr{V) 
satisfies all clauses in v. The value of C{r,4>) is the maximum fraction of edges that can 
be simultaneously satisfied by any labeling. 

We have the following hardness result for Label-Cover. 

Theorem 2.11. Given a E3-Sat instance 4> on n variables and r € N, there is an algo¬ 
rithm that constructs C{r,(j)) in time and that the output Label-Cover instance 

has the following properties: 

• If (j is satisfiable, then the value of C{r,(f>) is 1. 

• If (j is unsatisfiable, then the value of C{r,(f)) is at most for some universal 

constant Eq G (0,1). 

In our construction of Label-Cover instance with matrix labels, we need to use the 
following Parallel Repetition theorem from Rao |33j . which applies to projection games 
(Label-Cover), with the advantage that the rate at which the soundness decreases is 
independent of the label size of the original instance. 

Theorem 2.12 (Parallel Repetition [33]). There is a universal constant a > 0, such that 
for a Label-Cover instance 'b, i/Opt('b) <1 — e, then Opt('I'"') < (1 — e/2)“^"'. 

2.3 Low-Degree-Long-Code 

In this section, we review the basics of Low-Degree-Long-Code. The formulation here 
is from |T 2 | and HZ]. Towards the end of this section, we prove a key lemma that we will 
use for proving our superposition hardness results. 

For a positive integer m, denote by Pm the vector space of m-variable functions F™ —> 
F 2 . For /,5 G Pmj let A{f,g) be the Hamming distance between / and g. For a subset of 
functions F C Pm, the distance between g and F is defined as A{g,F) = min^gj-A(/, gr). 
We define the following dot product on Pm- 

Definition 2.13 (Dot Product). For f,g € Pm; the dot product is defined as {f,g) = 

fiYaix)- 

Denote by Pm,d be the space of functions with degree at most d. For a subspace 
C Pm,d; denote its dual by A-^ = {g € Pm | V/ G .4,, {f,g) = 0}. It is well known that 

pJ- _ p 

~m,d ~ 

For /3 G Pm, denote by supp(/3) the support of /3, that is supp(/3) = {x \ j3{x) = 1}. 
Define wt(/3) = |supp(/3)|. 
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Definition 2.14 (Low-Degree-Long-Code). The Low-Degree-Long-Code encod¬ 
ing for an m-bit string a € F™ is a function Aa : Pm,d —^ F2, defined as Aa{g) = g{a), for 
all g G Pm,d- 

Definition 2.15 (Character Set). For B G P™, define the corresponding character function 
X/3 : Pm,d M as xgif) = (-1)^^’-^^ 

Define the character set A^^d to be the set of functions fi £ Pm which are minimum 
weight functions in the cosets of Pm/Pm d> where ties are broken arbitrarily. 

We have the following result about the character set and the “Fourier decomposition” 
for functions Pm,d from |12j . 

Lemma 2.16. The following statements hold: 

• For any fi, B' € Pm, X/3 = X/3' if and only if B - B' ^ Pm,d- 

• For B G P;^ X/3 is the constant 1 function. 

• For any B, there exists B' , such that B — B' ^ Pm d, and |supp(/3')| = A(/3, fi). We 
call such B' the minimum support function for the coset B + P^d- 

• The characters in the character set Arn,d form an orthonormal basis under the inner 
product {A,B) = BfePm,AMf)Bif)]. 

• Any function A : Pm,d can be uniquely decomposed as 

^( 5 ) = X] ^hXhig) ■ 

• ParsevaTs identity: For any A : Pjn,d ^ 

The following lemma relates characters from different domains related by coordinate 
projections and is from [El- 

Lemma 2.17. Let n < m, and S C [m] with |5| = n, and let tt : F™ ^ F 2 be a projection, 
mapping x G F^ to x|5 G F2 . Then for f G Pn,d and B G Pm, we have 

X/3(/o7r) =Xn2{0)if), 

where TT2{B){y) = Exevr-Ty) Pi^)- 

Like in the classical Long-Code reductions, we enforce special structures on the tables. 
This is a technique known as folding. The following properties of the Fourier coefficients 
of folded functions were also studied in [12]. 

Definition 2.18. A table A : Pm d —>■ K folded if for any f G Pm d, we have A(f -|- 1) = 

-A{f). 

Lemma 2.19. If A : Pm,d —^ K is folded, then for any a such that Aa 7 ^ 0, we have 
q;(x) = 1. In particular, we have supp(a) 7 ^ 0. 

Definition 2.20. Let qi,--- ,qk G Pm,3, and let 

J{qi, •••,%):= I n G Pm,d -3 

We say that a function A : Pm,d —> K is folded over J if A is constant over cosets of J in 

Pm,d- 
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The following lemma shows that a function folded over J does not have weight on small 
support characters that are non-zero on J. 

Lemma 2.21. Let (5 € Pm be such that wt(/3) < 2'^“^, and there exists some i € [k] and 
X G supp(/3) with qi{x) ^ 0. Then if A : Pm,d — K is folded over J, then Ap = 0. 

In the actual reduction, gi, • • • , will be the set of functions associated with vertices 
in the Label-Cover instance, as described in Definition I2.1UI 

In |12] . Dinur and Guruswami proved the following theorem about Reed-Muller codes 
over F 2 . 

Theorem 2.22. Let d be a multiple of 4. If fd (z Pm is such that A(/3, Pm,d) ^ then 


E 

5~Pm.d/4 


E ixpigh)] 




Note that X/didh) = (—The key lemma we will now prove is a general¬ 
ization of the above theorem. The setting is that we have an additional t functions 
Ai,--- ,At : Pm,d ^’ 2 - We show that as long as t is small compared to 2 '^/^^ 
expectation jg g^iH close to 0 for arbitrary Ai, • • • , At. 

We use some of the key steps in [T^] . 

Definition 2.23. For /? and k < d, define 

^dM ^ I ^ Pm,m-d-l+k} ■ 

Note that is a subspace of Pm,k ■ 

For positive integers d, k, define ^d,k : N —>■ N as follows; if d < /c, then ^ii,k is 
identically 0 , otherwise 

^d,k{D) = min |dim(P(m, fc)) - dim(R^™^(/I))| . 

/3GPm:A(/3,P(m,m—d—1))>D 


The following two claims are from [12], which serve as the basis step and induction 
step for their lower-bound for ^d,k{D). 

Claim 2.24. For d> k and D > 1, ^d,k{F>) > 1. 

Claim 2.25. For all d > k and 40 < D < 2‘^, ^d,k{D) > ^d-i,k{D/4:) -|- (j)d-i,k-i{F)/A). 

For D = 2'^-4 = 4 <i/ 2-2 ^ _ ^^^ 2 , applying the above for a depth of df2 — 4, reducing 

D from 4'^/2-2 we have Ad^d/ 2 {‘^‘^ ^ 4^ This gives the following theorem. 

Theorem 2.26. For all integers m,d such that m > d > 0 and 4|d, if /3 : Vff —?■ F 2 has 
distance more than 2'^-4 from Pm,m-d-i, then the subspace (as a subspace of 

Pm,d/ 2 ) b,as codimension at least 2‘^l‘^~^. 

We remark that Dinur and Guruswami used different degree parameters in [12] for 
their application. Otherwise, the above theorem is the same as in |12| . 

We are now ready to prove the main theorem of this section. 
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Theorem 2.27. Let (3 : F™ ¥2 be a polynomial with distance more than 2'^“^ from 
Pm,m-d-i- Let t G N and • • • ,At : Pm,d /2 1^2 be some arbitrary t functions. Let p, 

he the uniform distribution on Pm,d/ 2 - Then 


E 




= E 




^ 2-i2‘^t^~‘^-t)/2 


Proof. Denote by W the quotient space Pm,d/ 2 l■ By Theorem 12.261 we have 
w := dimCby) = 

The expectation we are considering can be written as 


E E 

50,feo~W 


(^ — l^{hgAh}+T.l^i Aiig)Ai(h} 


( 1 ) 


Consider / G Pm,d/2 and g G We have {/3f,Pg) = (PgJ) = 0, because 

/ € Pm, d /2 and /3g £ Pm,m-d/2-i = Pm d/ 2 ' This allows US to define “dot product” between 

elements in W. In particular, for any /, f',g,g' G Pm,d/2 such that f-f',g-g' G 
we have 


= Wf, Hg') + m - f'),f3g') + {fdf, I3{g - g')) + (/3(/ - /'), /3(5 - g')) 

= Wf,Pg). 

This means that taking any representative from W will give the same result for this “dot 
product”. 

We can thus further rewrite the expectation as 


© 


E 


(_1)(/59o,/3^o> 


E 


g-g-goeB^JfJ/iih) 


Mg)Ai{h) 


( 2 ) 


Consider the matrix M G ]r 2 “'+‘x 2 “+‘^ where the rows and columns are indexed by a 
pair (/o,a) where /o G W and a G Fg, and the entries are 


Define vector u G 


as 


^/o,a 


Pr 

9~Pm,d/2 


fi( - /o G A\/i G [t],Ai{{g)) = Oi 


Since in d^D, 5 and h are sampled independently, we can verify that the expectation in ([2D is 
exactly Mu. Moreover, since g is chosen uniformly random from Pm,d/ 2 ) the probability 

that g — fo £ -Brf™/ 2 (/^) exactly 2~^, thus all entries in u have value at most 2 “"', and 
therefore ||u ||2 < 2 “"'/^. 
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We finish the proof by studying the spectrum of M. Observe that M can be written as 
the tensor product of a 2"^ x 2^ matrix and a 2* x 2* matrix as follows. Define W G 
as 

for /ojfi'o € W. Define H G as 

We can easily verify that M = W ® H. 

The matrix H satisfies HH^ = 2* • /, where I is the identity matrix, therefore we have 
that the eigenvalues of H all have absolute value exactly 2*/^. For the spectrum of W, let 
fo,go G W be two rows of W. Consider the dot product of row /o and go of matrix W 

WjWgg = (^—i^{P{fo+9o),l3ho) _ (^_i^{P{fo+go),ho) ^ 

hoGW hoGW 

The above sum is 2"' if /3(/o + go) G Pm,m-d/ 2 -i: or in other words /o and go belong to 
the same coset in W, and otherwise the sum is 0. Hence we have WW'^ = 2^-1, and thus 
the eigenvalues of W all have absolute value 2"'/^. We conclude that the tensor product 
matrix M = W ® H has eigenvalues with absolute value . 

We can now upper-bound the absolute value of the expectation by \'uFMu\ < 2 ("'+b /2 . 

||lt||2 = □ 

3 Superposition Hardness for Gap TSA 

Let b be some large integer parameter. The Tri-Sum-And (TSA) predicate is a predicate 
on 5 F 2 -variables defined as follows 


TSA(xi, • • • ,x^) = 1 + xi + X2 + xo + X 4 X 5 . 

From the definition, we can see that TSA instances are systems of quadratic equations, 
each involving exactly 5 F 2 -variables. 

The predicate was studied in [2D] as a starting point of an efficient PCP construction. 
For the predicate itself, Hastad and Khot proved that it is approximation resistant on 
satisfiable instances. 

In this section, we prove a superposition hardness result for TSA. 

Theorem 3.1. There is a reduction that takes as input a E3-Sat instance of size n, and 
outputs a TSA instance of size following properties: 

• If the E3-Sat instance is satisfiable, then there is an assignment that satisfies all 
TSA constraints. 

• If the E3-Sat instance is unsatisfiable, then for any odd integer t < (logn)^, and 
any t assignments, at most a 15/16 fraction of the TSA constraints are satisfied in 
superposition. 

The reduction runs in time ^ 

Proof. The reduction follows a similar approach as a typical inapproximability hardness 
reduction. 
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Given a E3-Sat instance, we apply Theorem 12.111 with soundness l/(1000(log n)^^) to 
get a Label-Cover instance. This gives the parameter r = (261oglogn-|-0(l))/eo, where 
eo is some universal constant. The vertex set of the bipartite graph has size 
and the label sets are L = {0,1}^ and R = {0,1}^*^. Let d = 0(6 log log re) be such that 
2 d/ 2 -A ^ (logn)^ -|- 3. This implies also that 2'^ ^ 256(logre)^^. 

For each u € U and v € V, we expect functions /„ : Pr,d —>■ {“Ij 1} and : Psr.d —>■ 
{ — 1,1}. We assume that all functions are folded over constant. The entries of the functions 
correspond to variables of some TSA instance. Therefore the number of variables in the 
output instance is and the number of constraints 

is polynomial in the number of variables. 

Consider the following test: 

1. Sample random edge e = {rei, re2} ~ E. Let vr be the projection on the edge, and let 
/ and g be the functions associated with rei and U 2 - 

2 . Sample uniformly random query x ~ y ~ Par-,^) and v,w ^ Par,d/2- 

3. Construct query z := x o tt + y + vw G Par,d- 

4. Accept iff f{x)g{y)g{z){g{v) /\ g{w)) = 1, where A here denotes the binary operator 
that evaluates to —1 when both operands are —1, and 1 otherwise. 


The completeness is straightforward. In this case, the Label-Cover instance has a perfect 
labeling. Setting the functions to be the Low-Degree-Long-Code encoding of the labels 
gives an assignment that satisfies all TSA constraints. 

In the soundness case, there exists some t < (logre)^ assignments that satisfy in su¬ 
perposition a 15/16 fraction of the constraints. That is, for each ui € U and U 2 G V, 
there are t functions that are folded over constant, ■ Pr,d 

g^^\--- ,5^*^ : Psr.d “>■ {“1)1} such that over random sample of edges {rei,re2} and 
queries x,y,z,v,'w, with probability at least 15/16, the number of z G [t] such that 
/W( 

x)g^^\y)g^^\z){g^''\v) /\ g^'^\w)) = 1 is odd. By an averaging argument, we have 
that for at least 3/4 of the edges, over random sample of queries, the above holds with 
probability at least 3/4. Call such an edge good. 

We assume that the functions are folded in the same way. Recall that when apply¬ 
ing folding, we partition the domain of the functions into equivalence classes, dehne the 
function value in one of the equivalence classes, and then extend to the full domain by 
adding appropriate constants. For our reduction, we identify one equivalence class for each 
vertex, and the t functions associated with it supply value only for that equivalence class. 
This is to make sure ■ ■ ■ , and g^^\ ■ ■ ■ , g^^'^ corresponds exactly to t assignments 
in superposition. 

Fix a good edge for now, and we drop the subscripts rei and U 2 . Then we have the 
following 



F 

x,y,z,v,w 


■ t 

n A <7«(re;))) 

.i=l 



or 


F 

x,y^z^v,w 


z 

n A5«(re;))) 


_2 = 1 



13 








Let f = 0^=1 a-iid g' = 01=1 Since t is odd, we have that /' and g' are both 
folded over constant. Taking the Fourier expansion of f and g', we have the following 


2 - ^ 

Z x,y,z^v,w 


. 2=1 


A5«M)) 


= E 


f{x)g'{y)g'{z)Yl{9^'\v) A g^'-^w)) 


2 = 1 


X] f'a9'p,9'l32 


l3i,l32eA^r,d 


E 

x,y^z^v^w 


Xa{x)x9i {v)X 92 {xo-K + y + vw) 71?)) 


2=1 


'y ^ / n2{/3)9 /3 E 

vw 


A5W(tc)) 


2=1 


Applying Cauchy-Schwarz and using Parseval, we have 


i ^ ( E 5 '?) f E ^- 2 (/ 3 )PJe 

\l3€A3r,d 


yf}GA3r,d 
y ^ / 172(13)9 p E 

/3GA3r,d:wt(/3)<2'^-'* 

^2 _2 

y ^ f 172(9)9 9 E 

vw 

/3GA3,.,d:wt(/3)>2'^-4 

-\d—A 


n 2N 


X9{xw)W{g^''\v) Ag^"\w)) 

i=l 
t 

Xpivw) A5(*^(u;)) 

2 = 1 

t 


2=1 


For the terms where wt(/3) > 2“ we apply Theorem 12.271 to get 


E 

vw 


Xpiyw) A5(*)(u;)) 


2 = 1 


< 2 


_(2dA-i_t)l2 


and therefore 


7A 

/ 1 J -^2(9)9 9 

/9GA3,,,d:wt(/3)>2'i-4 
t 

X/3(uu;) P[(5rW(u) A fi(W(u;)) 


E 

vw 


n 2 


2=1 


< 2 




This gives us 


/ ^ 3 172(9)9 9 

l3&A3r,d-^t(9)<2d-d: 

/3eA3^_,j:wt(/3)<2‘^-4 


X9{xw)W{g^'''\v) Ag^'-\w)) 


2 = 1 


1 

> - 
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Let {ui,U 2 } be a good edge. Consider the following labeling strategy: for tti, pick a 
^ 2 

with probability /'„ and pick a random label from supp(a), and for U 2 , pick (3 with prob- 

ability g'g and pick a random label from supp(/l). The procedure is well defined because 

/' and g' are all folded, and thus by Lemma 12.191 supp(a) and supp(/3) are nonempty. 

Also, for (3 such that wt(/3) < 2^“^ < 2'^“^, by Lemma 12.211 the assignments in supp(/3) 

all satisfy the clauses in U 2 - Then the probability that the labeling of ui and U 2 satisfies 

^2^2 

the projection constraint on a good edge {ui, M 2 } is at least H f'-n- 2 W) 9 'l 3 — 
l/(8-2'^“^) > l/(100(log n)^^). Since there are at least a 3/4 fraction of good edges, overall 
the labeling satisfies more than (3/4) • (l/(100(log n)^^)) > l/(1000(log n)^^), contradicting 
the fact that in the soundness case the Label-Cover instance does not have labeling with 
value larger than l/(1000(log n)^^). This completes the proof. □ 

4 Label Cover with Matrix Labels 

We now use Theorem o to construct a Label-Cover instance with properties similar 
to that in [29]. The proof follows closely the approach in m, Section 5-7. 

We first give an analogue of Theorem 13.11 over a large field of characteristic 2. 

Theorem 4.1. Let q = There is a reduction that takes as input a E3-Sat 

instance of size n, and outputs a system I of quadratic equations over¥q of size 
with the following properties: 

• Each quadratic equation in X involves exactly 5 variables. 

• If the E3-Sat instance is satisfiable, then there is an assignment that satisfies all 
equations in X. 

• If the E3-Sat instance is unsatisfiable, then for any integer t < (logn)^, and any t 
assignments, at most a 15/16 fraction of the equations in X are satisfied in superpo¬ 
sition. 

The reduction runs in time 

The proof is almost identical to Theorem 3.4 and Theorem 5.3 of m , where they proved 
that a {t ■ log g')-superposition hardness with gap 6 for systems of quadratic equations over 
F 2 implies t-superposition hardness with the same gap for systems of quadratic equations 
over Fq, where q = T' and Fg is an extension field of F 2 . 

We now construct Label-COVER with matrix labels from the above theorem. The 
construction is via a Point vs. Ruled-surface test. The test is very similar to the one in 
[30] . The main difference is here we use the low error version of the Low Degree Test, 
instead of the one used by Khot and Saket in |3U]. The following theorem appears as 
Theorem 5.1 in |2|, which follows from the works of Arora and Sudan [6] and Rubinfeld 
and Sudan [55] . 

Theorem 4.2. Let a < 10“^, and d and m be positive integers, and Fg be a field with 
q > lOOd^m. Let f : F™ —>■ Fg be a function, and for every line in¥^, let fi be a univariate 
degree d polynomial. For a point x on the line I, we denote by fi{x) the value given by fi 
at the point x. 

Suppose that over the choice of a random line I and a random point x £ I, we have 
P^i,x[fix) = fi{x)] > (1 — a), then there is a multivariate polynomial of total degree at 
most d which agrees with f on at least (1 — 2a) fraction of the points. 
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Let X be the instance produced by Theorem 14.11 and let N = be the 

number of variables in X. Let m = [log A^/log log A^] = O(logn), and h = [logA^] = 
O(lognloglogn), so that /i™' > N, and let d := m{h — 1) = 0(log^ nloglogn). The 
number of vertices and edges in the Label-Cover instance produced by the reduction 
would be poly(g™') = 

We identify the variables of I with 5™ where 5 C Fg is of size h. Any ¥q assignment 
(T to the variables of X can be interpreted as an assignment a : —?• Fg and can be 

extended to a corresponding polynomial g : F™ —>■ Fg of degree at most {h — 1 ) in each 
of the m coordinates. Let C denote the set of constraints of X, each over / := 5 variables. 
Denote every constraint C G C as C where {xi}\^^ is the set of points in 5”^ 

corresponding to the I variables of X on which the constraint is defined. 


Definition 4.3. A curve uj -.Vq ^ F™ of degree r is a mapping u:{t) := (wi(t), • • • ,u}mit)) 
where eaeh Uj is a degree r univariate polynomial in t. 


For the rest of this section, fix distinct values tg, 


said to correspond to a constraint C 
for i = 1,..., / , oj{t*) = Xi- We now dehne the notion of a ruled surface. 


,t* G Fg. A degree I curve uj is 
{xi}\^i and an additional point x if io{tQ) = x, and 


Definition 4.4. A ruled surface R = R[uj,y] where co(t) is a curve and y G F™ is a 
direction, is a surface parametrized by two parameters t, s G Fg, where 


R[uj,y]{t,s) = io{t) + sy. 


For a constraint C, a point x and a direction y, let R[uj,y] be a ruled surface where u 
is the curve of degree I corresponding to C and x. Let TZc be the class of all such ruled 
surface corresponding to constraint C € C, and let 7Z := Ucec'^c- Suppose the assignment 
g : F™ —)■ Fg is a global polynomial of degree d. The restriction of g' to a ruled surface 
i? G 7?. is a bivariate polynomial — in t and s — of degree at most d* := Id = 5d in 
variable t and at most d in variable s. The total number of coefficients of such a bivariate 
polynomial is at most d*d = 0{d‘^). The Label-Cover instance X is constructed as 
follows: 


Left vertex set U This consists of all points in F™. The label set is the set of values Fg 
that can be assigned to the points. 

Right vertex set V The set of right vertices is R, the class of all ruled surfaces over all 
constraints C (z C. The label set of a ruled surface R G 7^ is the set of all bivariate 
polynomials in t and s of degree at most d* in t and at most d in s. Such a polynomial 
is represented by a vector of its coefficients. As mentioned before, the dimension of 
this vector is 0{d^). For a ruled surface R corresponding to a constraint C, there is 
a quadratic equation on these coefficients, and a label g is valid — 5 G T{R) — iff 
the values of g at points {{t*,s = satisfy C. 

Edges For every ruled surface R a TZ and every point x G 7?, there is an edge between x 
and R. The edge is satisfied by a labeling g to the surface R and a label p to the 
point X if g{x) = p. Note that the computation of g{x) is linear in the coefficients of 
9- 

To analyze the above construction, we need the following result from |30j . 

Observation 4.5. Given an equation C , piek a uniformly random point x G F^, and let 
u he the curve corresponding to C and x. Then, for any t G Fg \ {tj, • • • the point 

uj{t) is uniformly distributed in F™. 
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Theorem 4.6. Let k = (logn)^, q = be as in Theorem \4-l[ and 6 = . 

There is a reduetion that takes as input a E3-Sat instance of size n, and outputs a Label- 
Cover instance with parameters h,m,d,l,d* as described above. The instance L has the 
following properties: 

1. The label set for u £ U is ¥q, and the label set for v € V is the set of vectors of 
length 0{df‘) = 0((logn)®) over Fg. For each edge e = {u,v), the projection vr® map 
coefficient vectors of surface polynomials to their value at points on the surface, and 
are homogeneous and ¥q-linear. The coefficient vector is also supposed to satisfy an 
equation C given by a quadratic equation over¥q. 

The size of C is poly(g™) = 

2. If the E3-Sat instance is satisfiable, then there is a labeling to the vertices of C 
that satisfies all the edges, and that the label for every ruled surface R satisfies the 
associated quadratic equation. 

3. If the E3-Sat instance is unsatisfiable, then the following cannot hold simultaneously: 

• For every left vertex x € F™, there are k labels pf, • • • ,p^ £¥q. 

• For every right vertex R £lZ, there are k labels (polynomials given as coefficient 
vectors) gf, ■ ■ ■ ,gjf, such that the associated equation is satisfied in superposi¬ 
tion by those k labels. 

• For (l — Yo^) ffioLction of the edges of C, between a point x and a ruled surface 
R, we have 

gf{x)=p^, 'ij £{!,...,k} . 

4 . 5-Smoothness: For any surface R, let g be a non-zero label. Then 

Pr [g{x) = 0] < (5. 

x£R 


Proof. The reduction is described as above. Let X be a system of quadratic equations over 
Fq given by Theorem 14.11 and let C be the set of equations. 

The 5-smoothness property follows from Schwarz-Zippel lemma: any non-zero label 
of surface R gives a non-zero polynomial g, and its evaluation at a random point on the 
surface is zero with probability at most d/q < 6 . 

In the completeness case, there is an assignment to each variable in T. Therefore, there 
is a degree d polynomial g : F™ ^ F^ that gives this assignment to the corresponding 
points in S^. The left vertices are labeled using assignments given by g, and each ruled 
surface R in the right vertex set are labeled by the polynomial given by the restriction 
of / to R. This assignment satisfies the mapping on the edges, as well as the quadratic 
equation associated with R. 

For the soundness case, suppose for contradiction that no k assignments satisfy more 
than a 15/16 fraction of equations in Z, but there are k labelings for the vertices of C, such 
that all associated equations of the right vertices are satisfied in superposition, and the 
mappings on the edges are satished for a (l — fraction of the edges. By an averaging 

argument, we have that for a (l — ^) fraction of the equations C £ C, we have 

Pr 

HgTZc 

xGR 


A id fix)= p") 


i=i 


> 1 - 


50/c 


17 






We say that the equations C that satisfy the condition above are good. Fix one good 
equation C. We say that a line l{s) is contained in a ruled surface R{t, s) if it is obtained 
by fixing a value of t in R(t, s) = u}{t) + sy. Since choosing a random point on a ruled 
surface is equivalent to choosing a random line I contained in R then choosing a random 
point on I, the above probability can be rewritten as 


Pr 

l£R,x£l 

From Observation we can see that the above probability is essentially equal to the 
probability obtained by first picking a random line I and then a random R G TZc containing 
the line. Let TZ^ be the set of ruled surfaces of i? G TZc that contain line 1. Thus we have 


A (sf (^)= Pj) 

i=i 


> 1 - 


50k 


Pr 

l^xGl 


k 


A (afix) 

i=i 



> 1 - 


50k 


( 3 ) 


We now argue that for each j = 1,... ,k, there exists a unique polynomial Pj of total 
degree at most d, such that 


Pr 

R&'R-c 

x£R 


k 





( 4 ) 


Define fic " ^ fk ■ F™ —> Fg as fj{x) = pJ for all x G F™ and j G {1,..., A;}. For each 
line I, construct H{1) = (/ii(Z), • • • ,hk{l)) as follows: first choose a random R G TZq, and 
let hj{l) be the univariate restriction of the bivariate polynomial g^ to the line 1. Let E 
be the following event (over the choice of Z, x G Z, and H): 


k 

E^ /\{Mx) = hj{l)ix)) . 
i=i 

Equation ([3]) can be re-stated as Fti^x&i,h[E] > 1 — The choice of H is independent 
of Z and x, therefore there is a deterministic setting of H for which Pr/ > 1 - 5M- 

Applying Theorem 14.21 we obtain polynomials • • • ,Pk '■ F™ —>■ Fg, each of total degree 
at most d, such that for each j G {1, ..., Zc}, we have 

Pr[/j(x) / Pj{x)] = Pr[pJ ^ Pj{x)] < ^ . 

Note that the choice of polynomials are globally unique — we get the same set of polyno¬ 
mials regardless of which good equation we fix — due to Schwarz-Zippel lemma and that 
d/q<^0{l/k). 

Observe that a random point on a random ruled surface is essentially distributed uni¬ 
formly randomly in F^. Therefore, using Equation ([3]) and the above, along with a union 
bound over the k polynomials, we get Equation ([3|). 

From Equation (HD, we have that there is one ruled surface R G TZc such that 


k 


A (A'f (®) = 



Pr 

x£R 
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Using Schwarz-Zippel again, we have that for each j G {1,..., A:}, the polynomial must 
be the restriction of Pj to the surface R, and thus Pj is consistent with at the points 
(t*, 0), • • • , {t*, 0). This means that the values given by Pi, • • • ,Pk satisfy the equation 
C in superposition. This holds for each good equation C, and therefore the assignment 
given by Pi, • • • ,Pfc satisfy in superposition a 19/20 fraction of all quadratic equations in 
I, contradicting the assumption about X that no k assignments satisfy in superposition 
more than a 15/16 fraction of the equations. □ 

As in [29], we abstract out the above Label-Cover and get the following statement 
for Label-Cover with F 2 vector labels. Moreover, we boost the soundness by taking 
0{k^) rounds of parallel repetition and apply Theorem 12.121 It is important that we 
apply the parallel repetition theorem from [33|, because the label size here is large. Unlike 
earlier versions of parallel repetition theorem, the statement from [33] is independent of 
the label size and therefore is efficient for our purpose. Note that the smoothness property 
is preserved by parallel repetition. 

Theorem 4.7. Let k = (logn)^ he as in Theorem \4.1[ and let 5 = , There is a 

reduction that takes as input a E3-Sat instance of size n, and outputs a Label-Cover 
instance L with the following properties: 

1. The label set for U isF™', and the label set for V isF^’', where mi,nir = (logn)^^'''*^^^^. 

For each vertex in V, there is a set of quadratic equations over ¥2 associated with it, 
and the coefficient vector is supposed to satisfy all the associated equations. 

The size of the vertex set of C is . 

For each edge e, the mapping tt® is homogeneous and ¥ 2 -linear in the entries of the 
vectors, that is, there is a matrix such that for x G F™* and y G F^’', 

TT^^iy) = X iff x = Aey. 

2. If the E3-Sat instance is satisfiable, then there is a labeling to the vertices of C, that 
satisfies all the edges, and that the label to the right vertices satisfy all associated 
quadratic equations. 

3. If the E3-Sat instance is unsatisfiable, then the following cannot hold simultaneously: 

• For every left vertex u € U, there are k labels • • • , 

• For every right vertex v gV, there are k labels y\, - ■ ■ , y/ that satisfy in super¬ 
position all the associated equations. 

• For fraction of the edges e = {u,v) of C, we have that for all 

j £{!,■■■ ,k}, 7r®(yJ) = x“. 

4 . 5-Smoothness: For any v G V and non-zero label y'", over the choice of a random 
edge incident on v, we have 

Pr =0] <5. 


Proof Sketch. The first step is to translate the Label-Cover from Theorem 14.61 with Fg 
vector labels to one with F 2 vector labels. This is done by choosing an arbitrary 
vector representation for elements of Fg. This gives a F 2 -vector-labeled Label-Cover 
instance with the same guarantees as in Theorem 14.61 but with label sets F™* and F^’’ for 
U and V, respectively, and mi,mr = (logn)^^^^^\ 
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The second part of the reduction is to apply Theorem l2.12[ where we take the number of 
rounds n in Theorem [2T2] to be such that the repeated game has soundness 
This requires n = 0{k^). This gives the final Label-Cover instance, and the procedure 
increases mi and by a multiplicative factor of n. 

To see that smoothness is preserved by parallel repetition, suppose we have a vertex 
V := {vi, ■ ■ ■ , vt) and non-zero label y := {yi, ■ ■ ■ ,yt) for the t-round repeated game. Then 
there exists s € t} such that yg 7 ^ 0. Also, for a uniformly random neighbor 

u := {ui,--- ,ut) of V, the vertex Ug is also a uniformly random neighbor of Vg. Note 
that a necessary condition for 7 r^^’'"\y) = 0 is that TT^^‘”'"‘’\yg) = 0 , and this probability is 
upper-bounded by the smoothness property of Theorem 14.61 □ 

We now construct the final Label-Cover with matrix labels as follows. 

Theorem 4.8. Let k = (logn)^ be as in Theorem \4-l\ There is a reduction that takes 
as input a E3-Sat instance of size n, and outputs a Label-Cover instance with the 
following properties: 

1. The label set for the left vertex set U is (md the label set for the right 

vertex set V is where mi,mr = (log. 

For each right vertex, there is a set of homogeneous linear F 2 equations involving 
entries of the labeling ofv. The set of valid labelings r(u) consists of matrices that 
satisfy all the associated linear equations. 

The size of the vertex set of C is . 

For each edge e = {u, v), there is a matrix Ag, such that labeling Mu and My satisfies 
vr® iff My = AMyA^. Note that the constraint is linear in the entries of My and My. 

2. If the E3-SAT instance is satisfiable, then there is a labeling to the vertices of C that 
satisfies all the edges, and that for each right vertex v & V, its label My is symmetric, 
rank(M^) = 1, the (1,1) entry of My is 1, and My G r(n). 

3. If the E3-Sat instance is unsatisfiable, then for any labeling a for the vertices in U 
and V, the following cannot hold simultaneously: 

• For each V € V, the matrix a{v) is pseudo-quadratic, /las rank((T(u)) < (logn)^/2, 
and is valid a{v) G r(u). 

• For at least fraction of the edges e = {u, v) of C, we have that 7r^{a{v)) = 

a{u). 

4 . Smoothness: For any v G V and My such that DffMy) 0, then over the choice of 
a random edge incident on v, we have 

Pr [Di{TT^'^’'’\My) = 0)1 < . 

W^V 

Proof. We start with the Label-Cover instance from the previous theorem. 

The underlying bipartite graph of the new instance is exactly the same. The parameters 
my and mi are the same as before. The labels for u G 17 in the new instance are now 
matrices from and the labels for u G P are from 

constraints for labelings for vertices in n G P are the following: 
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1. The matrix label M is symmetric, and for i = 2, • • • , + 1, we have Mi^i = Mi^j = 

Mj^i. These are all homogeneous linear constraints. Note that if in addition we have 
Mi^i = 1, then we get that M is pseudo-quadratic. Here, however, we do not include 
the latter constraint as it is not homogeneous. In fact, this will be handled by the 
inner verifier. 


2. For each quadratic constraint in the previous instance, we include the linearized 
version of it in the new instance. That is, term XiXj is replaced by entry (i + 1, j +1) 
of the matrix, term Xi is replaced by entry (l,i + 1), and constant 1 is replaced by 
entry (1,1). 


For an edge e, let A'^ be the matrix in the Label-Cover instance from Theorem l4.7[ then 


we define the matrix for the new instance to be Ag, = 
be a label. Then it is mapped to 


1 0 
0 A' 


Let M,, = 


a a 
P D 


= AgM.Al 


a OLA!g \ 
A!gP A'gDA’^ ) • 


In the completeness case, a vector label a in the previous theorem is transformed into 
the matrix label (1 q;)( 1 a)^. 

For the soundness case, suppose that there are pseudo-quadratic matrices Mu and My 
for each u G U and u € F, such that My satisfies homogeneous linear constraints associated 
with V, lank^My) < k, and that for fraction of the edges e, 7r®(M„) = My. 

For vertex u E F, by Lemma [T9l there exists odd integer I < 3/2 • (log n)^/2 < (logra)^ 
vectors S where y'= 1 for i E [/], such that My = Yl\=iV'i ® v'v 

and the assignments yi := Di(yi), ■ ■ ■ ,yi := Di{yi) satisfy in superposition the quadratic 
constraints of the Label-Cover instance from Theorem 14.71 By padding 0 assignments, 
we can make sure that we have exactly (logn)^ assignments yi, • • • , y(iogn)*> satisfy in 
superposition the quadratic constraints of the Label-Cover instance from Theorem 14.71 
and for all j E [(logn)^], yj is in the column space of Di{My). 

For the decoding of vertices in U, we use the following lemma from [30] (Lemma 7.3), 
adapted to our choice of parameters. The proof is identical and we include it here for sake 
of completeness. 


Lemma 4.9. Fix any u E F, a rank parameter I < (logn)^, and a matrix M E 

sueh that rank(i7i(M)) = 1. Then over the ehoiee of a random neighbor u of v, we have 

rank(i7i(7r(“’’') (M))) = I except with probability 


Proof. Using Lemma 2.1 of [30], Di{M) can be decomposed into the canonical form 

S t 

Di{M) = ^ (g) Zj -h ^ (Zs+2j-l O Zs+2j + Zs+2j <8> Zs+2j-l) , 

i=i i=i 

where I = s -|- 2t is the rank of Di(M), and zi,--- ,zi are linearly independent. For 
j = 1, - ■ ■ ,1, define z' := Then Di{tt^^’'"\M)) can be written as 

S t 

Di{tT^^’^\M)) ='^z'j 0 z'j + + 4+2i ® ^^+2j-l) > 

i=i i=i 

Consider a non-zero linear combination z of the vectors {znV- ,. The vector A', .z is the 

L J J j = l 

corresponding linear combination of the vectors . ) s-iid is non-zero with probability 
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2 (logn)*'+3 smoothness guarantee from Theorem 14.71 Taking a union bound over 

all 2 ^ — 1 non-zero linear combination, we conclude that except with probability at most 

, the vectors are also linearly independent and spans the column space 

of T»i( 7 r(“’^)(M)). □ 

The smoothness property of the new Label-Cover instance follows easily from the 
above lemma. 

For each u, we choose (logn)^ uniformly random vectors xi,--- from the 

column space of Di{Mu). Now we analyze the expected value of this assignment. 

Note that here the soundness parameter S> 2“^'°®”^*’^^. Therefore, by the 

above lemma, for fraction of the edges e = {u,v), we have rank(Di(Mi,)) = 

rank(Di(7r®(Mt,))) = rank(L)i(74eMi;^^)) = TSink{A'^Di{My)A'J'). 

Fix such an edge. Since rank(Di(My)) = icank{A'^Di{Mv)A'^) < Tank{A'gDi{Mv)) < 
rank(Di(Mu)), this means that rank{A'^Di{M^)A'^) = j:ank{A'^Di{M^)) = rank(Di(My)). 
For any y that is in the column space of Di{My), A'^y is in the column space of A'^Di{My), 
and since rank( 74 gDi(My)) = rank{A'^Di{My)A'^), we conclude that A'^y is also in the 
column space of A'^Di{My)A'J = Di{My). Thus for edge e, with probability at least 
2 “('°S”')^*’, we get vr(yj) = A'^^yj = Xj for all j € {1, ■ ■ ■ , 1 }. 

Overall, this labeling satisfies fraction of the 

edges in the old instance. □ 

5 Hypergraph Coloring Hardness 

We now compose the Label-Cover from Theorem 14.81 with a Quadratic-Code inner- 
verifier to get inapproximability result for hypergraph coloring. 

Theorem 5.1. There is a reduction that takes as input a E3-Sat instance of size n, 
outputs a 8-uniform hypergraph H with the following properties: 

• The size of the hypergraph H and the running time of the reduction are both upper- 
bounded 62 /exp((log 

• If the E3-Sat instance is satisfiable, then H is 2-colorable. 

• If the E3-Sat instance is unsatisfiable, then H does not have independent set of 

fractional size larger than ). 

In other words, it is quasi-NP-hard to color a 2-colorable 8-uniform hypergraph of size N 
with less than colors. 

The following proof is based on a note by Girish Varma m- 

Given the Label-Cover instance from Theorem l4.81 we expect for each vertex v €zV a 
function fy : ^ The expected encoding for matrix label a{v) = ay®ay 

is fv{A) = {oy ®ay,A) = ayAoy. Let Hy C dual of the subspace of 

the set of pseudo-quadratic matrices that satisfies the linear constraints associated with v. 
The function fy is folded over 

Consider the following Boolean 8 -uniform test: 

• Choose u & U uniformly at random, and v,w & V uniformly and independently at 

random from the neighbors of n. Let 7 r,a : p(™i+i)x(mi+i) ]^g 

projections corresponding to the edges {u, v) and (n, w) respectively, and let S'tt and 
Sfj be the index set associated with them. 
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• Uniformly and independently sample Xi, X2, Yi, ^2 € p g p(mi+i)x(mi+i)^ 

and X, y, z, x', y', z' € Let e € be the vector with only the 1-st entry 1 

and the rest 0. 

• Accept if and only if the following 8 values are not all equal: 

fv{Xi) fviXs) where X 3 := Xi+X 0 y + F OTT 

fv{X 2 ) fviXi) where Xi := X 2 + {x + e) ^ z + F o TT 

fwiyi) fwiYs) where Ys := Yi-b x'(8> y'F o cT-t-e 0 e 

/«)(Y2) /«;(Y4) where Y4 := Y2 + (x'-I- e) (8> Y + F o cr + e (g) e 

We denote by T the test distribution. 

The vertex set of the hypergraph has size 

exp((logn)(5+Yi))fe)) . = exp((log =: N . 

5.1 Completeness 

Let yv (g) y„ for u € Y and Xu <gi x^ for u G f7 be a perfect labeling for the Label Cover 
instance, with = Xu,i = 1 and for each edge e = {u,v} € F, we have {yv)\se = ^u- 
Consider the 2-coloring where for each v € V, fv{X) = y'^Xy^ = {X,y^ (g) y^). Such 

a function is constant over cosets of T-Ly. Let xi := (Ai,y^ 0 y„), X 2 '■= {X 2 ,yv 0 yv), 

yi := (Yi, 0 y^), y 2 := (Y 2 , y^ 0 y^), and / := (F, x„ 0 x„). Note that (F, x„ 0 x„) = 

(F, TTu^viUv g) 2 / 1 ;)) = (F o 7r„^, y^ 0 y^). Also, (e 0 e, y^ 0 y^,) = (e, y^,) = 1. Therefore, the 
value of the 8 queries are 

xi xi-b (y^,x)(y^,,y)-b / 

X 2 X 2 +{{yv,x)+ l){y^,z) + f 
yi yi + {yw,x'){yw,y') + f + i 

2/2 2/2 + ((2/t«, Y)-b l)(y«,, Y)-b /-b 1 

We finish the proof of the completeness case by a case analysis. 

If {yv,y) = {yw,y') = 0, then the sum of entries in the first and third row is 1, which 
means that there are different values. Similarly, we conclude that if {yv,z) = {yw,z') = 0, 
then using similar argument as above, there are different values in the second and the 
fourth row. The same applies to the case when {yv,x) = (y 2 ,x') = 1, and the case when 
{yv,x) = {yw,x') = 0 . 

Suppose now that {yv,x) = 1 and all entries are equal. Then from the second row, we 

have that / = 0, and from the first row, we get {yv,y) = 0. By the discussion above, we 

have that (y^, y') = 1, and the third row gives us {yw,x') = 1, but then the two entries on 
the last row are different. 

Suppose otherwise that {yv,x) = 0 and all entries are equal. Then from the first row, 
we have / = 0, and the second row implies (y^, z) = 0. By the discussion above, we must 
have {yw,z') = 1, and the last row gives {yw,x') = 0, leaving two different entries in the 
third row. 

Hence fy gives a valid 2-coloring of Q. 

5.2 Soundness 

Let 6 = be the soundness parameter from Theorem 14.81 and k = (logn)^/2 be the 

rank upper-bound from Theorem 14.81 
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Lemma 5.2. If there is an independent set in Q of relative size s, then 


s® < <5 + 


1 

2^/2+! ■ 


Proof. Consider any set A C V{G) of fractional size s. For every v (z V, let : 
jp^mr+i)x(mr.+i) [0,1] be the indicator function of A, extended such that it is constant 
over cosets of Pv The fractional size of A is given by 


E [fv{X)] = E \fvfi 

V'^V V'^V L 

+ 1) X (TTJ-r + 1) 


The set A is an independent set if and only if 

4 


0:= E E l\fv{Xi)U{Yi) = 0. 

u,v,w Xi,Yi^T , 

2=1 


(5) 


Taking Fourier expansion and considering expectations over Xi, X 2 ,Vi,V 2 , we get the 
following: 


0 


E 

u,v,w 


f? 


2;,Q:i 


f? 


v,a2 


y : 

ai ,02 ,/3i ,/32eF(™’'+^^ 

E[x ai {x(giy)]x ai (Fan) 
y 

E[Xa2 ((x + e) (8) z)]xa 2 (T o tt) 


E[x/3i {x' (8) y')]x/3i (T o o-)x/3i (e <8) e) 
y' 

fl,h E[X/32 i{x' + e) (8) z')]xii 2 {F o a)xiS 2 (e <8) e) 

z' 


Denote the term inside Ef,x,x'[‘] as Termu,v,w{oii,a 2 , /3i, / 32 ). 
For the characters involving F, we have 


E [Xai (E o 7r)xa2 (E o 7r)x/3i (E o cr)x/32 (E o cr)] 


= E 

F L 


[]_]_](^(“1+“2),F’) + ((7(/9i+/32),F) 


and since F G jg chosen uniformly at random, the above is 0 unless 7r(ai + 

a2) = o-(/3i + ^ 2 )- 

Let F(a) := {a,e 0 e}. Taking expectations over x,y, z,x',y', z', we have that when 
7r(ai + a 2 ) / (t{/3i +/32), Termu,v,w{o(i, a 2 , Pi, 132 ) = 0, and otherwise 


Termu,v,w{ai,a2, Pi, P 2 ) 

= (_-\Y^hi+h2) 72 72 72 72 

\ J v,aiJ v,a2J w,0iJ w,I32 

Pr [aix = 0 A a 2 X = 0:26] Pr \Pix = 0 A P 2 X' = P 2 e\ . 

X x' ^ ■' 
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The terms that are potentially non-zero can now be partitioned into three parts: 

00 = E 'y] Termu,v,w{ai,a2, Pi, 132) 

u,v,w 

rank(«i-|-a2),rank(/3i-|-/J2)<A: 

7r(ai+02)=o-(/3l-1-/32) 
i/(/3l-1-/32 )=0 

©1 = E 'y Termu,v,w{ai,a2,Pi,p2) 

u,v,w 

rank(ai-|-a2),rank(/3i+/32)<fc 
7r(ai+02)=o-(/3l-1-/32) 
i^(/3i-1-/32 )=1 

©2 = E y] Termu,v,w{ai,a2,Pi,P2) ■ 

u,v,w 

max{rank(Q:i+Q:2),rank(/3i+j52)}>fc 

7r(Qi+a2)=cr(^i+^2) 


We first lower-bound ©q. Note that all terms in Bq are positive. Consider the term 
corresponding to oli = a 2 = Pi = P 2 = 0. We have 


E &/^,o = E 

u,v,w u 




Therefore ©0 > s®. 

For ©1, we have the following upper-bound 


l©i| < 


P P P P 

J v,aiJ v,a 2 J w,l3iJ w,I32 ' 

rank(Q:i- 1 - 0:2 ),rank(/3i+/32)<fc 
7r(Qi+Q2)=a-(/3i-|-/32) 
u{0i+02) = l 


E V 

oil ' ^ 


( 6 ) 


Consider the following randomized labeling strategy for vertices in u G U and v £ V: for 
V £V, pick (,01, ,52) with probability and set its label to Pi + P 2 ', for rt € U, pick 

a random neighbor v, and choose (01,02) with probability j^^aifv,a 2 label to 

7r(ai -|- 02). Due to folding, we have that Pi and P 2 both satisfies the homogeneous linear 
constraints associated with v, and so does Pi + P 2 - Therefore the right hand side of ([6|) 
gives the probability that a random edge of the Label Cover is satished by this labeling. 
Thus |©i| < 6 . 

For ©2, note that if rank(o) > k, then for any fixed b, Pra;[ox = 6] < Therefore, 

for any fixed choice of u,v,w, all terms in ©2 have absolute value at most 
Combined with Parseval’s identity, we conclude that |©2| < l/2^/^'’'^. □ 


We conclude that any independent set in Q has fractional size at most 2 
therefore the chromatic number of G is at least 2*°§ = exp((log 
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